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Abstract 

The extraction of the weak phase a from B — > tttt decays has been 
controversial from a statistical point of view, as the frequentist vs. 
bayesian confrontation shows. We analyse several relevant questions 
which have not deserved full attention and pervade the extraction of a. 
Reparametrization Invariance proves appropriate to understand those 
issues. We show that some Standard Model inspired parametriza- 
tions can be senseless or inadequate if they go beyond the minimal 
Gronau and London assumptions: the single weak phase a just in the 
AI = 3/2 amplitudes, the isospin relations and experimental data. 
Beside those analyses, we extract a through the use of several ade- 
quate parametrizations, showing that there is no relevant discrepancy 
between frequentist and bayesian results. The most relevant informa- 
tion, in terms of a, is the exclusion of values around a ~ vr/4; this 
result is valid in the presence of arbitrary New Physics contributions 
to the AI = 1/2 piece. 
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1 Introduction 



The extraction of the CP violating phase a [1] has lead to some recent con- 
troversy confronting the results and statistical methods of two different col- 
laborations: the frequentist approach advocated in references [2,4] and the 
bayesian approach employed in reference [3]. In reference [2] J. Charles et al. 
presented an important criticism to the bayesian methods used by the UTfit 
collaboration in order to extract the angle a of the unitarity triangle b-d from 
7r7r and pp data. The criticism relies heavily on the statistical treatment of 
data: frequentist vs. bayesian. The answer of the UTfit collaboration [3] 
rises some interesting points, both on the interpretation of the results and 
on the importance of the physical assumptions on the hadronic amplitudes. 
The authors of [2] have recently answered to this UTfit reply in [4]. The 
aim of the present work is to clarify several issues central to an adequate 
understanding of the physics at stake. We also want to call the attention 
on the importance of reparametrization invariance (Rpl) in the sense intro- 
duced by F.J.B. and J. Silva in reference [5] to do so. We will not enter 
the polemic arena of statistical confrontation. With regard to this, we will 
instead illustrate the compatibility of results obtained in both approaches as 
long as things are done properly; notwithstanding, we will not ignore some 
"obscure" aspects of both approaches that are somehow swept under the 
rug as the statistical confrontation rages on, they illustrate that rather than 
sticking to one approach and deprecating the other it may be wiser to learn 
lessons from both. 

This work is organized as follows. We start section [2] with a short re- 
minder on reparametrization invariance and its implications, then we use the 
exclusion or inclusion of B — > 7r 7r° data together with Rpl to clarify the ori- 
gin of our knowledge on a. In section [3] we study critically Standard Model 
inspired parametrizations. We devote section H] to a detailed analysis of the 
impact on the results of allowed ranges for some parameters. The lessons 
from previous sections set up the stage for an adequate extraction of a, to 
which section E] is dedicated, especially in the presence of New Physics (NP) 
in loops. Several appendices deal with aspects left out of the main flow of 
the discussion. 
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2 Reparametrization invariance and B — > 7T7T 
2.1 Weak Phases 

We start this section with a short reminder of the findings presented in 
reference [5] concerning the parametrization of decay amplitudes and the 
election of weak phases. A generic parametrization of the decay amplitude 
of a B meson to a given final state and the CP-conjugate amplitude is the 
following^]: 

A — Mi e +i<pl e iSl + M 2 e +i(p2 e l&2 , 

A = Mi e - '* 1 e iSl + M 2 e~^ 2 e i5i , (1) 

where 4>j are CP-odd weak phases, 8j are CP-even strong phases and Mj 
the magnitudes of the different contributions. The first property to consider 
is the full generality, as long as <pi — <p 2 ^ mod [n], of Eq. ([T]), i.e. any 
additional contribution M 3 e ±ll ^ 3 e lS3 can be recast into the previous form as 



and thus 



e ±ifc = sm^ 3 - (p 2 ) e±Uh + sin^s ~ gxj e±i4>2 ^ (2) 
sin(0i-0 2 ) sin(0 2 -0i) 



A' = A + M 3 e +i ^e iSs = M[e +i(pl e i5 '^ + M^e+^e 16 * , 

A 1 = A + M 3 e- itf,:i e iS3 = M[e- i(t>1 e iS i + M'^ 2 e iS * , (3) 



with 



M[e^ = Mie^+M z e ^ m 03 ~ 02 , 

sm(0i-0 2 ) 

Af 2 e* = M 2 e^ + M 3 e^ Sin S 03 ~ 01 | . (4) 

sm(0 2 - (pi) 

We can also use Eq. (T5]) to change our basic set 4> 2 } of weak phases to any 
other arbitrary set of weak phases {<pi, ip 2 }, as long as (fi — (p 2 ^ mod \k\: 

A = Mi e +itpi e iAl +M 2 e+^ 2 e iA2 , 

A = Mi e~ itpi e iAl +M 2 e~ ilp2 e iAi , (5) 

where 

Mie iAl = Mie ^ ^i-^) 2 ^ 2 sin(0 2 -^ 2 ) 
sm(v9i - ip 2 ) sm(v9i - (p 2 ) 

M 2 e^ = Mie ^^-^\ + M 2 e ^ 2 -^\ . (6) 
sm(y? 2 - tpx) ' sm(y? 2 - tp x ) 



1 If the final state is ±1 CP eigenstate, A should include an additional ±1 factor. 
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This change in the basic set of chosen weak phases should have no physical 
implications, hence the name reparametrization invariance. We remind two 
main consequences of Rpl in the absence of hadronic inputs. For an extensive 
discussion see [5]: 

1. Consider two basic sets of weak phases {0i, 02 } and {0i, ^2} with 2 7^ 
if2] if an algorithm allows us to write 02 as a function of physical 
observables then, owing to the functional similarity of equation (CTJ and 
([5]), we would extract y?2 with exactly the same function, leading to 
02 = ¥2, i n contradiction with the assumptions; then, a priori, the 
weak phases in the parametrization of the decay amplitudes have no 
physical meaning, or cannot be extracted without hadronic input. 

2. If, experimentally, the direct CP asymmetry C = (\A\ 2 — \ A\ 2 )/(\A\ 2 + 
I A\ 2 ) is C = 0, then the decay amplitudes can be expressed in terms of a 
single weak phase, which could be sensibly extracted, up to discrete am- 
biguities, through the indirect CP asymmetry S = 2 Im(74A*)/(|74| 2 + 
\A\ 2 ). Additionally, if the theoretical description of the decay ampli- 
tudes only involves a single weak phase from a basic Lagrangian, then 
it can be identified with the phase measured through S. 

As we will see, this two results apply respectively to the 7i + 7t~ and 7r + 7r° 
channels. Essentially, the first one will be operative in the AI = 1/2 piece 
and the second one in the AI = 3/2. 



2.2 Removing 7r°7r° information 

To make our point transparent we will start by studying the extraction - in 
fact the non- extraction - of a from ttti data when B — > 7r°7r° experimental 
information is removed. Let us start with a widely used [2,3], Standard 
Model inspired, parametrization of the decay amplitudes: 

A + _ = A(B° d -> 7T+7T-) = e~ ia T + - + P , 
V2A +0 = V2A(B + -> tt+tt ) = e~ ia (T + ~ + T 00 ) , 
V2A 00 = V2A{B° d -> ttV) = V2A +0 - A + _ = e~ ia T m - P , 

ee A(B° d -> tt+tt-) = e +ia T + ~ + P , 
V2A +0 ee y/2A(B~ -> tt-tt ) = e +ia {T + ~ + T 00 ) , 

V2A 00 EE >/2A(SS -> 7T 7T ) EE v^A+o - A + _ = e^T 00 - P . (7) 

When 7r°7r° experimental information is removed we have two decoupled de- 
cays: 
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1. 7r + 7T° data, i.e. the average branching ratio B +0 and the direct CP 
asymmetry C +0 , provide, respectively, \T + ~ + T 00 | and a consistency 
check C +0 = 0; a is irrelevant there. 

2. 7r + 7r~ data, i.e. , C + ~ and the mixing induced CP asymmetry 
S + ~ , give information on a decoupled from 7r + 7r°, on |T + ~|, \P\ and 
the relative (strong) phase Spt+- between T H and P. 

With three observables and four parameters everybody knows or suspects 
that one cannot really extract a: we have C + ~ ^ 0, as reminded in section 
12.11 a cannot be extracted from B — > 7r + 7r~ in this limited case. One can try, 
nevertheless, to obtain a probability distribution function (PDF) for a as in 
reference [2]. This PDF, obtained in an analysis with three observables and 
four unknowns, has obviously a strong dependence in the priors, as in figure 

2 of [2]. Even worse, reparametrization invariance [5] tells us that , A + _ 

can also be written as 

A + _ = e - ia 'T + - + P', = e +ia 'T' + - + P' , (8) 

where a' is any weak phase - known or unknown, a' ^ 0, 7r -. In this scenario 
the conclusion is clear: any information one would get for a would also be 
valid for any a' and thus it cannot be assigned to a. This solves the puzzle 
raised in the MA and RI parametrizations within figure 4 of reference [2]: 
those PDFs cannot be attributable to a. Just with that data alone we cannot 
extract a' - whatever it is -, as we have emphasized in 12.11 To illustrate this 
issue we compute the PDFs of figure [1] in the following parametrization: 



A+- 


= A{B° d -> 7T+7T-) = e~ ia 'T + - + 


P, 


V2A +0 


= V2 A(B + -> tt+tt ) = e- ia {T~ 


h- _|_ rpOO 


V2A 00 


= y/2 A(B° d -> vrV) = V2A +0 - 




A + - 


= A(B° d -> tt+tt-) = e +ia 'T + - + 


P, 


V2A +0 


ee V2 A(B~ -> tt-tt ) = e +ia {T~ 


h- _|_ rpOO 


V2A 00 


= >/2 A(B° d -> = V2~A +0 - 





(9) 

Notice that just with a = a', Eq. © recovers the parametrization in Eq. ((7j). 
The phase of T + ~ is set to zero (i.e. all strong phases are relative to arg(T + ~)) 
and flat priors are used for all the parameter^!, that is, moduli |T + ~|, |T 00 |, 

2 The allowed ranges for the different moduli and the sensitivity to them in this and 
other cases will be addressed later, for instance, for this example, they are all limited to 
lie in the range [0; 10] x 10~ 3 ps -1 / 2 . 
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\P\ and phases 5p = arg(P), 5 = arg(T 00 ), a and a'. Results in other 
parametrizations, being equally illustrative, are relegated to appendix O 




25 50 75 100 125 150 175 25 50 75 100 125 150 175 

(a) a PDF (b) a' PDF 





25 50 75 100 125 150 175 2 5 50 75 100 125 150 175 

(c) Joint (a', a) PDF (d) a = a' PDF 

Figure 1: PDFs of a and a' from B — > tttt without tc°tt data. 



The lesson of this example is rather obvious: the set of observables being 
insensitive to a, its PDF is uninformative (just the flat prior in this case); 



the PDF in figure 1(d) , erroneously identified with a, is nothing else than a' 
itself, whatever it could be. 



2.3 Including back n n information 

When we incorporate B — > 7t 7t° data to the isospin construction, |v4oo| (|^4oo|) 
gives the angle among A + q (A + q) and v4 + _ (A + _); using then the known 
phase difference between A + _ and the angle among A +0 and A +0 is 

obtained. This is just the isospin analysis giving a. Knowing a, i.e. with a 
fixed, A + _ = e ~ ia T + ~ + P would have full meaning and C+_, £+_} 

would fix the three hadronic parameters. Unfortunately the isospin analysis 
as explained above yields allowed values for a spanning a wide range. The 
degeneracy of solutions together with the experimental errors do not fix a, 
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just exclude some region. In this situation {B + _ , C + _, S+_} do not really fix 
the hadronic parameters and, consequently, they tend to generate a spurious 
PDF for a as we have seen. The final "a" is thus a sort of convolution of the 
a obtained from the isospin analysis and the spurious one "extracted" purely 
from 7r + 7r~ data. This is illustrated with the PDFs of figure EJ making use 
of the parametrization in Eq. 0. 
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(a) a PDF 



25 50 75 100 125 150 175 

(b) a' PDF 
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25 50 75 100 125 150 175 



25 50 75 100 125 150 175 

(d) a = a' PDF 



(c) Joint {a 1 , a) PDF 

Figure 2: PDFs of a and a 1 from B — > 



To stress the importance of this issue we repeat the previous example 
while arbitrarily reducing all experimental uncertainties by a common factor 
of 5. The PDFs corresponding to this fake scenario are displayed in figure El 

The results shown in figures [U [2] and [3] deserve some comment: 



1. Figures 1(b) and 2(b) are almost identical; in the former we were not 
using B — > /T°7r° information while in the later we were doing so. This 
similarity is a dramatic illustration of the spurious nature of the "ex- 
tracted" a'. 



2. Figure 2(d) is the cut of the joint PDF in figure 2(c) along the line a 



a'. Therefore the so called MA extraction of a is a sort of convolution 
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2 5 50 75 100 125 150 175 25 50 75 100 125 150 175 25 50 75 100 125 15 175 



(a) a PDF (b) a' PDF (c) a = a' PDF 

Figure 3: PDFs of a and a' from B — > tttt with experimental uncertainties 
reduced by a factor of 5. 



of the Gronau-London a - figure 2(a) - and the spurious one. 



3. This a' PDF basically allows any value of a' except the neighborhoods 
of and n, which are a priori forbidden by S~* , C H ^ 0: obviously 
there is no way to produce CP violation in the 7r + 7r~ channel without 
two weak phases in the amplitude that controls it. The exclusion of 
a' = 0, 7r is the only physical information one can extract in the SM 
from the PDF of a'. 

4. The deep in the a distributions around a ~ 7r/4, which is transmitted 
to the a = a' PDF, is senseful. The exclusion of a ~ 0, tt is also 
physical inside the SM. Nevertheless, how strongly these 0, 7r regions 
are excluded is highly sensitive to the allowed ranges for |T H |, |T 00 | 
and \P\- see section H]-. As we move away form the a = 0, ir points, the 
final PDF of a would be more influenced by the spurious a' distribution. 
One can see that in the shape of the a distribution for a < 25° or 
a > 75°. 

5. As uncertainties are reduced, even with a = a', the valid ranges for 
the "real" a emerge, despite the a' distribution. That is, as experimen- 
tal uncertainties are reduced, the a' "pollution" of a through a = a' 
becomes increasingly ineffective, as it should, and just transmits the 
physical exclusion of a = 0, ir inside the SM. 

The main lesson from the previous example is: a is obtained from purely 
AI = 3/2 amplitudes, without additional hadronic input. Including it in 
AI =1/2 pieces, as reparametrization invariance shows, pollutes the legiti- 
mate extraction with information that one cannot claim is concerning a. 
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3 Standard Model inspired parametrizations 



As stated above, following the consequences of reparametrization invariance, 
the really legitimate sources of our knowledge on a are A +0 , A +0 . We have 
referred to the parametrization in Eq. ([7j) as a "SM inspired parametrization" 
of the amplitudes and we have discussed how the inclusion of a in A + _, A + _ 
is dangerous with present uncertainties. Nevertheless, it is clear that the 
exclusion of a ~ 0, 7r inside the SM is a valid physical consequence that comes 
from having a in A + _ and To further illustrate the importance and the 
subtlety of this issue let us consider in detail what can be interpreted as a 
"SM inspired parametrization" . Once we take into account reparametrization 
invariance, we only neecjf) to focus on A + _ and A + _: 

1. Rpl allows us to write {A + ^, A + -} in terms of any pair of weak phases 
{4>i, 2 } (as long as 0i — 02 7^ mod [ir]), nothing enforces the use of 
{0,a}. 

2. SM compliance of any parametrization only requires that the vanishing 
of all the SM phases leads to no CP violation, once again nothing singles 
out or requires the use of {0, a}. 

Consequently, as we have at our disposal other SM phases that we can choose 
to parametrize A+-, namely!^, /3, x, x' i instead of A + _ = e~ ia T + ~ + P 
and A^ = e ia T + ~ + P, we can for example write, on equal footing, 

A + _ = M ie iSl e- ix + M 2 e iS2 e- i/3 , A + _ = M ie i5l e +ix + M 2 e iS2 e +i ? , (10) 

or 

A + _ = e~ ix T + ~ + P, A + _ = e +lx T + - + P . (11) 

Within the SM x ~ @{^ 2 ), had we used this last parametrization (Eq. (TTTh ) . 
we would have found extreme compatibility problems^ that would be absent 
with another SM inspired parametrization: this is a dramatic illustration 
of the consequences of Rpl mentioned in section 12.11 In other words, pre- 
tending that one obtains information on SM "theoretical" phases just by 
parametrizing A + _ and A + _ with them is in general senseless. In this case 



we would have obtained that figure 2(b) is the PDF of the phase x-, the one 
that appears in B s -B s mixing [8-13]. 



3 A + q and A + q can be parametrized with a single weak phase, identifiable with a, Aqq 
and Aqq will follow from the isospin relations. 

4 7 = wg(-V ud V cb V: b V; d ), H = wg(-V cd V tb V c lV; d ), X = ^(-^.V^) and x ' = 
^(-V us V cd V: d V c %) [6]. 

5 Just look, for example, to the 0(X 2 ) ~ 2 — 3° region of the different a' PDFs in the 
plots of previous sections [7] . 
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4 Physics and parametrical problems 



In section |2] we mentioned that the exclusion of the "dangerous" a' near and 
7r depended on the allowed ranges for the parameters \T^\ and \P\. Figure H] 



shows the PDFs of a, a' and a 



a' for four different sets of allowed ranges of 

and 



\T lJ 1 and \P\. On the one hand, the PDFs of a in figures 4(a) , 4(d) , 4(g 



4(j) are quite similar. On the other hand, the PDFs of a 1 in figures 4(b) , 4(e) 
4(h) and 4(k) are completely different: the "dangerous" a', especially in the 



regions close to 0,7r, is sensitive to the applied bounds. This is automatically 
transmitted to the a = a' PDF and it is in this way that the region with 
"a" close to 0,7r is suppressed (even wipped out as in figures 4(c) and 4(i)) 
through the cuts on the spurious a', induced by the cuts on |T U | and \P\. 
One could think that this is particular to the bayesian statistical approach, 
figure [5] shows the frequentist confidence level curves for a computed under 
the same parametric restrictions. As we use the parametrization of Eq. (J7J), 
they correspond to the a = a' plots in Figure HI It is rather clear that 
without regard to the statistical approach, limiting the values of \T^\ and 
\P\ has observable effects in the extraction of a. Note that figure 5(c) differs 
from figure 5(a) not by a cut but by a change in the shape, even if it is not 
a dramatic change. 

The authors of reference [2] pointed out that there is some peculiar limit 
with a — > together with P/T + ~ , T 00 /T + ~ — > —1, |T + ~| — > oo - using the 
parametrization of Eq. ([7]) - that keeps all the observables "in place" : it is 
in fact a question of having a' — > rather than a — > 0. This peculiar limit 
is useful to understand the a ~ 0, tt exclusion above mentioned. To obtain 
parameter configurations with high likelihood when approaches or n, 
the required values of \T^\ and \P\ are increasingly large. Imposing bounds 
on \T^\ and \P\ automatically limits how close to 0, n one can push the weak 
phase while producing likely branching ratios and asymmetries. The use of 
the parametrization in Eq. ([9]) shows how this works for the dangerous a' 
and is then transmitted to a. 
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(a) a PDF (b) a' PDF (c) a = a' PDF 

Allowed ranges: \T ij \ E [0; 10] x 1(T 3 ps" 1 / 2 , \P\ E [0;2.5] x 10~ 3 ps" 1 ^ 





25 50 75 100 125 150 175 25 50 75 100 125 150 175 



25 50 75 100 125 150 175 



(d) a PDF (e) a' PDF (f) a = a' PDF 

Allowed ranges: \T^\ E [0; 10] x 10~ 3 ps" 1 / 2 , \P\ E [0; 10] x 10~ 3 ps" 1 / 2 
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25 50 75 100 125 150 175 25 50 75 100 125 150 175 25 50 75 100 125 150 175 



(g) a PDF (h) a' PDF (i) a = a' PDF 

Allowed ranges: \T»\ E [0; 5] x 10" 3 ps" 1 / 2 , \P\ E [0; 1.25] x 10~ 3 ps" 1 / 2 
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25 50 75 100 125 150 175 25 50 75 100 125 150 175 



25 50 75 100 125 150 175 



(j) a PDF (k) a' PDF (1) a = a' PDF 

Allowed ranges: \T^\ E [0; 25] x 10~ 3 ps^ 1 / 2 , \P\ E [0; 25] x 10~ 3 ps" 1 / 2 

Figure 4: PDFs obtained using the parametrization in Eq. ([9]) and different 
allowed ranges for |T 4J | and \P\. 
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25 50 75 100 125 150 175 25 50 75 100 125 150 175 

(a) \Tv\ < 10, \P\ < 2.5 (b) \T^\ < 10, \P\ < 10 




25 50 75 100 125 150 175 25 50 75 100 125 150 175 



(c) \T^ \ < 5, \P\ < 1.25 (d) \Tv\ < 25, \P\ < 25 

Figure 5: a CL; as usual \T lj \ and \P\ in units of ps -1 / 2 . 
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5 The extraction of ol from B — > tttt and New 
Physics 

Recently the UTfit collaboration has proposed to add information on the 
moduli of the amplitudes in order to extract a inside the SM. In particular, to 
add reasonable QCD based cuts on the moduli of T iJ and P. Even if we agree 
with this procedure, we must stress that the resulting PDF of a - see figures 



4(c) or 4(i) - in the non zero region mixes AI = 3/2 information with spurious 
AI = 1/2 information. In this case it does not seem dramatic, but it can be so 
in the B — > pp case - see [2] -. In addition, if one is trying to make a general 
fit of the SM it is more natural to use the AI = 3/2 piece of B — > tttt to 
get reliable bounds on a and once a is fixed by the general unitarity triangle 
analysis, use the AI = 1/2 piece of B — > tttt to obtain better information 
on the hadronic parameters. In fact, the UTfit collaboration presents results 
along this line in [3]. This implies our recommendation of using a in the A +0 
amplitude and another phase in or in the A J = 1/2 piece. 

After confronting the SM a la CKM with data, the most important ob- 
jective in overconstraining the unitarity triangle is in fact to look for New 
Physics (NP) [8-16]. When there is NP - just in the mixings or also in the 
AI = 1/2 decay amplitudes^] - it is not appropriate to use a SM inspired 
parametrization. In the limit where all SM phases go to zero, C + ~ and S -1 
can still be reproduced by NP loops. So, if we want to interpret the a PDF 

a we have to use a different CP- violating phase in the AI = 1/2 piece 
or in Parametrizations that fulfill these requirements are the so-called 

PLD, ES, the V parametrization in [2] and even our SM-like parametrization 
with a' in Eq. © despite having one more parameter. A similar one, which 
additionally factorizes an overall scale of the amplitudes, is the following, 
that we call Ti': 

A+- = e- ia T 3/2 {T + iP), V2A 00 = e~ ia T 3/2 (l - T - iP), 
V2A +0 = e~ ia T 3/2 , V2A +0 = e+ ia T 3/2 , 

A + _ = e + -T 3/2 (T-*P), V2A 00 = e+ ia T 3/2 (l-T + iP). 

(12) 

Notice that a global weak phase in A^ is irrelevant in C + and amounts to 

a global shift of axg(A + _A* + _). 

In this section we will "extract" a in a bayesian approach making use of 
different parametrizations; we will show the consistency of all those results 



6 With great accuracy - up to small electroweak penguins - this case corresponds to 
having NP everywhere except in tree level amplitudes. 

7 Where a — tt — (3 — 7, (3 — j3 ~ <f>d and the NP phase in B®~B® mixing is defined by 
Mf 2 = r 2 d e- l2 ^[Mf 2 } SM . 
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and then compare to frequentist results. From a fundamental point of view, 
as stressed in previous sections, we are not willing to use information beside 
assuming the triangular isospin relations, the single "tree level" weak phase of 
the AI = 3/2 piece and experimental results themselves. Reparametrization 
invariance and the presence of a single weak phase, a, in the AI = 3/2 
amplitudes A +0 and A +0 imply that all the results to be presented in this 
section will be valid in the presence of New Physics in loops. 

Figure [6] shows the PDF of a in three different cases: the 'PLD' [17] and 
'li' (Eq. (TI2"]) ) parametrizations, and the explicit extraction (as in [17] or [5]). 
Corresponding 68%, 90% and 95% probability regions are displayed in table 
CD, together with the frequentist 68%, 90% and 95% CL regions (in the fol- 
lowing, frequentist calculations are carried with the 'PLD' parametrization). 
These regions are represented in figure [7J Despite some small differences in 
the 68% regions, somehow expectable as they are more sensitive to details, 
the results are consistent, they coincide rather well. B — > nn data are still 
too uncertain to really provide important constraints on a, the only relevant 
feature being the exclusion of the a ~ 7r/4 region, which could be understood 
(see section IC.ll in appendix £3 in terms of the smallness of B 00 . 




25 50 75 100 125 150 175 

(c) Explicit extraction 



Figure 6: a PDFs. 
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OO/O 


yu/o 


yo/o 


PLD 


[0;5]° U [85;101]°U 
[121; 150]° U [168; 180]° 


[0;8]° U [82;107]°U 
[114; 157]° U [162; 180]° 


[0;9]° U [82; 110]° 
U[113;180]° 


li 


[95; 174]° 


[0; 1]° U [89; 180]° 


[0; 5]° U [85; 180]° 


Explicit 


[2; 8]° U [82;88]°U 
[100; 120]° U [125;145]°U 
[150; 170]° 


[0;9]° U [81; 91]° 
[95; 175]° U [179; 180]° 


[0; 10]° U [80; 180]° 


CL 


[0; 7]° U [83; 104]° 
[115; 154]° U [166; 180]° 


[0; 12]° U [78; 180]° 


[0; 14]° U [76; 180]° 



Table 1: a regions within [0; 180°]. 



68% 



90% 



95% 



25 50 75 100 125 150 175 

Figure 7: a regions (the ordering, top to bottom, is in each case: 'PLD' pa- 
rametrization, 'li' parametrization, Explicit extraction and frequentist anal- 
ysis). 
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Conclusions 



To our knowledge the discrepancies between frequentist and bayesian ap- 
proaches using the so-called MA and RI parametrizations with Eq. (JTj) have 
not been previously understood. We explain that with present experimen- 
tal uncertainties it is extremely unsecure to introduce the phase a in the 
AI =1/2 piece. To a great extent a spurious PDF of a tends to be gener- 
ated. The Gronau and London analysis is critically based on the appearance 
of one weak phase in the AI = 3/2 piece = 0). Introducing a in the 

AI = 1/2 piece - or A + _ - ^ 0) brings this "second" a to the category 

of 'not observable' even if one is using a Standard Model inspired parametri- 
zation. This difficulty is operative in the so-called MA and RI parametriza- 
tions. The introduction of a in the AI = 1/2 piece and some QCD-based 
bounds on the amplitudes allows - as done by the UTfit collaboration - to 
eliminate the solutions around a ~ 0, ir inside the SM. The PDF can still be 
partially contaminated with the spurious a distribution. In B — > im it is not 
dramatic but it could be so in other channels. This last procedure cannot be 
applied to an analysis with NP in loops. Therefore, we strongly recommend 
to use parametrizations where a is just included in the AI = 3/2 piece. We 
partially agree with the UTfit collaboration that, in spite of the differences 
among the frequentist and bayesian methods, both approaches give similar 
results if one uses parametrizations with a clear physical meaning. In this 
sense the most relevant result is the exclusion of the region a ~ 25° — 75°. 
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A Inputs and numerical methods 



Along this work we use the set of experimental measurements [18-28], com- 
bined by the Heavy Flavour Averaging Group [29] , in table [2j 





7T7V 


Bt° 

7T7V 


5.2 ±0.2 


1.31 ±0.21 


5.7 ±0.4 




q+- 


^00 
°7T7r 


-0.39 ±0.07 


-0.59 ±0.09 


-0.37 ±0.32 



Table 2: Experimental results, branching ratios are multiplied by 10 



In terms of B — > txtx amplitudes, 

\A^l + \A^l \AV\ 2 -\AV\ 2 i7 - _ 2 Un(.\>i.\'f ) 

~ TB * +J 2 ' \A i i\ 2 + \A i j\ 2, " \Av\ 2 + \Av\ 2 ' 

(13) 

All frequentist CL computations are performed by: (1) minimizing % 2 with 
respect to all parameters except the one of interest which is fixed (in this case 
a), (2) computing the corresponding CL through an incomplete T function. 
All bayesian PDFs are computed using especially adapted Markov Chain 
MonteCarlo techniques. 

B Experimental results and isospin relations 

The isospin relations 

A + _ + V2A 00 = V2A +Q , 

A + - + V2A 00 = v^A+o, (14) 

define two triangles in the complex plane whose relative orientation fixes a. 
The sizes of the different sides follow from Eq. (Fl4l) . 





V2\A 00 \ 


V2\A +0 \ 




V2\A 00 \ 


V2\A +0 \ 


1.441 


1.040 


2.634 


2.176 


1.533 


2.634 



Table 3: Numerical values of the sides of the isospin triangles computed with 
experimental central values, to be multiplied by 10~ 3 ps~ 1//2 . 
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This allows the reconstruction, up to a number of discrete ambiguities - 
namely up to eight -, of both triangles. Central values of present measure- 
ments yield the values of the sides in table El One straightforward question 
is mandatory: do those would-be triangles "close"? The answer is in the 
negative because 

\A+-\ + V2\A 00 \ = 2.481 ? 2.634 = V2\A +0 \ , 
\A+-\ + V2\A 00 \ = 3.709 > 2.634 = V2\A +0 \ . 

In fact, for those central values, the first triangle is not a triangle [30]. In 
terms of likelihood, the closest configuration to that situation, the most likely 
one, is having the first triangle flat, a feature which naturally explains the 
reduced - by a factor of two, from eight to four - degeneracy of a "solutions" . 
That is, while for old data the almost flatness of this same isospin triangle 
yielded eight different solutions distributed in four almost-degenerate pairs, 
those pairs are now degenerate and rather than exact solutions for the central 
values of the observables they produce best-fitting points. 

Consequently, the use of explicit solution constructions requires the rejec- 
tion of the joint regions of experimental input incompatible with the isospin 
relations Eqs. (114j) . For old data, this meant rejecting some 48.2% of allowed 
experimental input (weighting each observable with a gaussian with mean 
and standard deviation given by the corresponding central value and uncer- 
tainty), for the new data set this rejection rate is 70.9%. In the bayesian 
and frequentist treatments the isospin relations are assumed valid and all 
the subsequent analyses are "normalized" to that assumption. 

C Removing B — ► 7r°7r° information 
C.l Explicit extraction of ol 

This appendix is devoted to some complementary results extending what is 
presented in section 12.21 The first issue we will address is the explicit extrac- 
tiorE of a when B — > 7r°7r° information is removed, that is, no knowledge of 
B 00 and C 00 . The explicit extraction of a assumes the isospin relations in 
Eqs. (114j) so to start with, the ignorance on B 00 is not "just plain ignorance" 
(whatever this could stand for) as it will operatively mean that for any ex- 
perimental set of results {B + ~ , B +0 , C + ~, }, B 00 and C 00 should be such 

8 Bcside the explicit formula for a in terms of the available observables presented in 
reference [17] we also make use of the extraction of a explained in [5]; the results are com- 
pletely equivalent, however the later does not make any use of a particular parametrization 
of the amplitudes and is easily interpreted in terms of the isospin construction. 



18 



that both would-be isospin triangles are in fact isospin triangles. C 00 is obvi- 
ously restricted to be in the range [— 1; 1]; what about B 00 7 One could argue 
that if there is no information on B — > it should be smaller than a given 
bound or one can just let it be as large as allowed by other data and isospin 
constraints. This rather trivial fact is apparently at the origin of the discrep- 
ancy in the results presented in references [2,3] for the explicit extraction of 



a "without" B 



information: figure [H] shows two PDFs of a. They 



are obtained by generating known experimental sets {B + ~ , B +0 , C + ~ , } 
according to gaussian distributions with central values and standard devia- 
tions given by the quoted measurements and uncertainties (C + ~ and S + ~ are 
also restricted to be within [— 1; 1]), then C 00 and B 00 are generated through 
flat distributions, C 00 in the range [—1; 1] and B 00 in a range [0; B ® Iax }. Sets 
{B + -,B +0 ,C + -,S + -,B 00 ,C 00 } which fulfill the isospin relations Eqs. fl33} 
are retained and used to extract a. The PDFs of a represented in figure M 



only differ in the value of B^ ax , Fig. 8(a) was obtained with B^ ax equal to 



two times the present measurement while Fig. 8(b) was obtained with B 



)00 
'Max 



equal to twenty times the present measurement. On the one hand, the PDF 



in figure 8(a) coincides with the one presented in figure 4 'ES' of reference [2]; 



on the other hand the PDF in figure 8(b) agrees, more or less, with figure 
4 of reference [3]. It is now clear that the difference among both may be 



just due to the numerical procedure. Figure 8(b) shows that the removal of 
B — ► 7T 7r° information leads to a loss of knowledge on a. Ironically, there is 
a lesson in this example: numerics apart, the smallness of B 00 is responsible 
for the exclusion of values a ~ tt/4. 
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25 50 75 100 125 150 175 

(a) a PDF 



25 50 75 100 125 150 175 

(b) a PDF 



Figure 8: Explicit extraction without B — > 7r°7r°; lighter curves correspond 
to the different individual contributions related by the discrete ambiguities. 
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C.2 Parametrizations 

To complete the picture we now proceed to repeat the extraction of a when 
B — > 7r°7r° information is removed in several parametrizations. We will make 
use of the 'PLD' parametrization [17], of the 'li' parametrization with fixed 
weak phases in {A + _, A + _} (Eq. (TT2"]) ) and, finally, of the parametrization in 
Eq. @ but in this case, apart from a and a', instead of moduli and phases we 
will use real and imaginary parts of T + ~ , P and T 00 (the RI parametrization 
in reference [2]). The PDFs of a obtained for the first two parametrizations 
are shown in figure El they are eloquent: no knowledge on a. 




25 50 75 100 125 150 175 25 50 75 100 125 150 175 



(a) a PDF, PLD parametrization (b) a PDF, li parametrization 

Figure 9: Extraction without B — > 7r°7r°. 



For the RI parametrization we show the PDFs of a, a' and the one 
obtained by setting a = a' in figure QUI Once again it is clear that there 
is no information on a and that inappropriately insisting on including it in 



{A + _, A + _} produces the senseless result of figure 10(c) 





25 50 75 100 125 150 175 



25 50 75 100 125 150 175 



25 50 75 100 125 150 175 



(a) a PDF (b) a' PDF (c) a = a' PDF 

Figure 10: Extraction without B — > tt°7T , RI parametrization. 



The conclusion of this appendix is straightforward: just dealing with 
a reduced scenario in which B — > -k°7t° information is removed, a proper 
understanding of the subtleties involved in the parametrization of B — » im 
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amplitudes avoids peculiar results as for instance the 'MA' and 'RT ones 
included in figure 4 of reference [2]. We have shown here that starting with a 
flat prior for a consistently gives highly non-informative posteriors in several 
sensible parametrizations. 

D Using the RI parametrization 

In section 12.31 we used the parametrization in Eq. ([9]) to obtain figure [2] with 
flat |P +_ |, |P|, |T 00 |, arg(P), arg(T 00 ), a and a' priors. For completness 
we also show - figure [11] - the PDFs of a, a' and a = a' in case one uses 
flat |T+"|, Re[P], Im[P], Re [T 00 ] , Im[T 00 ], a and a' priors. Beside the 
effect of the spurious a' in the PDF of a = a', we can also appreciate the 
influence of the change in the priors: the integration domain is the same as in 
figure [2] but the integration measure is now different. The main effect is the 
relative enhancement of the contributions from regions with large parameters, 
including the contributions from the a' — > driven region. 




Figure 11: a extraction, RI parametrization. 
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E One short statistical comment 



Leaving completely aside philosophical aspects of probability, both frequen- 
tist and bayesian approaches start with a common likelihood function. Each 
approach reduces the information provided by the likelihood function in a 
different manner. Consequently, they do not yield strictly coincident results: 

• Bayesian posteriors obviously depend on the priors, for example the 
allowed ranges or the shape. As we have seen, we obtain different 
posteriors with different priors. However, as long as one is using sen- 
sible parametrizations and reasonable priors, we end up finding rather 
compatible results. 

• Frequentist CL curves do depend on the parametrization, to be precise, 
they depend on the allowed ranges for the parameters; once sensible 
parametrizations and adequate ranges are used, CL curves obtained 
with them are identical. The a — > limit in the SM inspired parame- 
trization of Eq. (J7|) illustrates this issue. 

Beside those well known issues, we may find troublesome that: 

1. Most probable values in the bayesian PDFs do not coincide with the 
analytical solutions for a. 

2. Intimately related to this aspect, bayesian PDFs seem unable to dis- 
tinguish among degenerate solutions. 

We remind that these statements concern one dimensional PDFs of a. Fre- 
quentist one dimensional CL curves distinguish a solutions because they are 
obtained through best fitting points for fixed a. Bayesian PDFs do not dis- 
tinguish them as the uncertainties produce distributions for the degenerate 
solutions which overlap and add up in the complete PDF. One can still have 
a hint of the proximity of different solutions from this kind of overlap, but 
this is not the point here. For reduced experimental uncertainties, bayesian 
PDFs would not overlap and would distinguish among those different solu- 
tions. This could be sufficient to think that, per se, there is no discriminating 
advantage in using one or the other approach. With present uncertainties, 
bayesian analyses seem incapable of pinning down the right location of the 
solutions in a and telling us something about their degeneracy. It is not a 
fundamental problem of bayesian methods as reduced uncertainties would 
overcome these "difficulties". If it is not a fundamental problem, could we 
somehow overcome these "difficulties" with present uncertainties? The an- 
swer is in the positive as the problem only arises because we are insisting 



22 



in the reduction of the available experimental information to obtain one- 
dimensional PDFs of a\ let us take a look to the joint PDFs in figure [T2"l 
These are the joint PDFs of (5, a) and (a e ff,a) obtained with the 'PLD' 
parametrization. They are quite illustrative, one can see the different solu- 
tions in a concentrated around the values of a dictated by the analytical 
expectations. The pretended fundamental drawbacks of bayesian methods 
to adequately place and distinguish the solutions are just a consequence of 
pushing too far, for the present level of experimental uncertainty in the re- 
sults, the statistical "reduction of information process". A simultaneous look 
to both frequentist and bayesian results will not put an end to the statistical 
discrepancies, notwithstanding it will be very helpful to understand the phys- 
ical results we are interested in. Both approaches are "information reduction 
processes" and strictly sticking to one and deprecating the other may not be 
the wiser strategy. 




25 50 75 100 125 150 175 25 50 75 100 125 150 175 



(a) Joint (S, a) PDF (b) Joint (a eff ,a) PDF 

Figure 12: Joint PDFs obtained with the 'PLD' parametrization. 
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